<div> MODERN APPROACHES TO MULTICLASS INTENT CLASSIFICATION BASED ON PRE-TRAINED TRANSFORMERS</div> Open database of scientific publications ITMO UNIVERSITY

MODERN APPROACHES TO MULTICLASS INTENT CLASSIFICATION BASED ON PRE-TRAINED TRANSFORMERS

Journal

Scientific and technical journal of information technologies, mechanics and optics

Artem A. Solomin, Yuliya A. Ivanova (Bolotova)

UDK004.8

Issue:4 (128)

Download PDF0 Kbyte

Annotation

Subject of Research. The paper considers modern approaches to the multiclass intention classiﬁcation problem. The user intention is the incoming user requests when interacting with voice assistants and chatbots. The algorithm is meant for determination what class the call belongs to. Modern technologies such as transfer learning and transformers can improve signiﬁcantly the multiclass classiﬁcation results. Method. This study uses a comparative model analysis technique. In turn, each model is inlined into a common pipeline for data preparing and clearing, and the model training but with regard to its speciﬁc requirements. The following models applied in real projects have been selected for comparison: Logistic Regression + TF-IDF, Logistic Regression + FastText, LSTM + FastText, Conv1D + FastText, BERT, and XLM. The sequence of models corresponds to their historical origin, but in practice these models are used without regard to the time period of their creation but depending on the effectiveness of the problem being solved. Main Results. The effectiveness of the multiclass classiﬁcation models on real data is studied. Comparison results of modern practical approaches are described. In particular, XLM conﬁrms the superiority of transformers over other approaches. An assumption is made considering the reason why the transformers show such a gap. The advantages and disadvantages of modern approaches are described. Practical Relevance. From a practical point of view, the results of this study can be used for projects that require automatic classiﬁcation of intentions, as part of a complex system (voice assistant, chatbot or other system), as well as an independent system. The pipeline designed during the study can be applied for comparison and selection of the most effective model for speciﬁc data sets, both in scientiﬁc research and production.

MODERN APPROACHES TO MULTICLASS INTENT CLASSIFICATION BASED ON PRE-TRAINED TRANSFORMERS

Scientific and technical journal of information technologies, mechanics and optics

Annotation

Keywords

Постоянный URL

Articles in current issue

MODERN APPROACHES TO MULTICLASS INTENT CLASSIFICATION BASED ON PRE-TRAINED TRANSFORMERS

Scientific and technical journal of information technologies, mechanics and optics

Annotation

Keywords

Постоянный URL

Поделиться

Articles in current issue